Thread-local storage (TLS) is a computer programming method that uses static or global memory local to a thread.
This is sometimes needed because normally all threads in a process share the same address space, which is sometimes undesirable. In other words, data in a static or global variable is normally always located at the same memory location, when referred to by threads from the same process. Variables on the stack however are local to threads, because each thread has its own stack, residing in a different memory location.
Sometimes it is desirable that two threads referring to the same static or global variable are actually referring to different memory locations, thereby making the variable thread local, a canonical example being the C error code variable errno
.
If it is possible to make at least a memory address sized variable thread local, it is in principle possible to make arbitrarily sized memory blocks thread local, by allocating such a memory block and storing the memory address of that block in a thread local variable.
Contents |
The API function TlsAlloc can be used to obtain an unused TLS slot index; the TLS slot index will then be considered ‘used’.
The TlsGetValue and TlsSetValue functions can then be used to read and write a memory address to a thread local variable identified by the TLS slot index. TlsSetValue can only affect the variable for the current thread.
The TlsFree function can be called to release the TLS slot index; the index will then be considered ‘unused’ and a new call to TlsAlloc can return it again.
TLS with POSIX threads (Thread-Specific Data in Pthreads nomenclature) is similar to TlsAlloc and related functionality for Windows. pthread_key_create creates a key, with an optional destructor, that can later be associated with thread specific data via pthread_setspecific. The data can be retrieved using pthread_getspecific. If the thread specific value is not NULL, the destructor will be called when the thread exits. Additionally, key must be destroyed with pthread_key_delete.
Apart from relying on programmers to call the appropriate API functions, it is also possible to extend the programming language to support TLS.
In Delphi or Free Pascal one can use the 'threadvar' reserved keyword instead of 'var' to declare variables using the thread-local storage.
var mydata_process: integer; threadvar mydata_threadlocal: integer;
In Java thread local variables are implemented by the ThreadLocal
class object. ThreadLocal holds variable of type T, which is accessible via get/set methods. For example ThreadLocal variable holding Integer value looks like this:
private static ThreadLocal<Integer> myThreadLocalInteger = new ThreadLocal<Integer>();
C++11 introduces the thread_local
[1] keyword which can be used in the following cases
Aside from that, various C++ compiler implementations provide specific ways to declare thread-local variables:
__thread int number;
__declspec(thread) int number;
int __thread number;
On Windows versions prior to Vista and Server 2008, __declspec(thread)
works in DLLs only when those DLLs are bound to the executable, and will not work for those loaded with LoadLibrary() (a protection fault or data corruption may occur).[3]
In the D programming language version 2, all static and global variables are thread-local by default and are declared with syntax similar to "normal" global and static variables in other languages. Regular global variables must be explicitly requested using the __gshared keyword:
int threadLocal; // This is a thread local variable. __gshared int global; // This is a plain old global variable.
Static fields can be marked with the ThreadStatic attribute:
class FooBar { [ThreadStatic] static int foo; }
In .NET 4.0 the System.Threading.ThreadLocal<T> class is available for allocating and lazily loading thread local variables.
class FooBar { private static System.Threading.ThreadLocal<int> foo; }
Also an API is available for dynamically allocating thread local variables.
Common Lisp provides a feature called dynamically scoped variables.
Dynamic variables have a binding which is private to the invocation of a function and all of the children called by that function.
This abstraction naturally maps to thread-specific storage, and Lisp implementations that provide threads do this. Common Lisp has numerous standard dynamic variables, and so threads cannot be sensibly added to an implementation of the language without these variables having thread-local semantics in dynamic binding.
For instance the standard variable *print-base*
determines the default radix in which integers are printed. If this variable is overridden, then all enclosing code will print integers in an alternate radix:
;;; function foo and its children will print ;; in hexadecimal: (let ((*print-base* 16)) (foo))
If functions can execute concurrently on different threads, this binding has be properly thread local, otherwise thread will fight over who controls a global printing radix.
In Python version 2.4 or later local class in threading module can be used to create thread-local storage.
import threading mydata = threading.local() mydata.x = 1
In Ruby thread local variables can be created/accessed using []=/[] methods.
Thread.current[:user_id] = 1
In Perl threads were added late in the evolution of the language, after a large body of existing code was already present on the Comprehensive Perl Archive Network. As a result, threads in Perl by default take their own local storage for all variables, to minimise the impact of threads on existing non-thread-aware code. In Perl, a thread-shared variable can be created using an attribute:
use threads; use threads::shared; my $localvar; my $sharedvar :shared;
The above discussion indicates what interface a programmer uses to obtain thread-local storage, but not how this works behind the scenes. The underlying problem is that, since all threads share an address space, no fixed memory location can be used to store the location of the storage. The following discussion applies to Microsoft Windows-based systems, but similar models may be applicable to other systems.
In Windows, the thread-local storage is accessed via a table. (Actually, two tables, but they appear as one.[4]) TlsAlloc returns an index to this table, unique per address space, for each call. Each thread has its own copy of the thread-local storage table. Hence, each thread can independently use TlsSetValue(index) and obtain the same value via TlsGetValue(index), because these set and look up an entry in the thread's own table. Only a single pointer is stored; any Windows system which offers more than one pointer of storage is either allocating multiple values or, more likely, obtaining storage from heap or stack and storing that in the pointer.
This leaves the question of how a per-thread table is to be found. In Windows, there is a Thread Information Block for each thread. One of the entries in this block is the thread-local storage table for that thread.[5] On x86 systems, the address of the Thread Information Block is stored in the FS register. In this way, access to thread-local storage carries a minimal overhead.